Probability Density Function of 
Kerr Effect Phase Noise 



Keang-Po Ho 

StrataLight Communications, Campbell, CA 95008 

kpho@stratalight. com 

The probability density function of Kerr effect phase noise, often called the 
Gordon- Mo llenauer effect, is derived analytically. The Kerr effect phase noise 
can be accurately modeled as the summation of a Gaussian random variable 
and a noncentral chi-square random variable with two degrees of freedom. 
Using the received intensity to correct for the phase noise, the residual Kerr 
effect phase noise can be modeled as the summation of a Gaussian random 
variable and the difference of two noncentral chi-square random variables 
with two degrees of freedom. The residual phase noise can be approximated 
by Gaussian distribution better than the Kerr effect phase noise without 
correction. © 2008 Optical Society of America 

OCIS codes: 190.3270, 060.5060, 060.1660, 190.4370. 
1. Introduction 

Gordon and Mollenauei^ showed that when optical amplifiers are used to compensate 
for fiber loss, the interaction of amplifier noise and the Kerr effect causes phase noise, 
often called the Gordon- Mollenauer effect or nonlinear phase noise. By broadening the 



signal linewidthP Kerr effect phase noise degrades both phase-shifted keying (PSK) 
and differential phase-shift keying (DPSK) systems that have renewed attention re- 
cently.^^ Because the Kerr effect phase noise is correlated with the received inten- 
sity, the received intensity can be used to correct the Kerr effect phase noise ISiZMSI The 
transmission distance can be doubled if the Kerr effect phase noise is the dominant 
impairment P^i 

Usually, the performance of the system is estimated based on the variance of 
the Kerr effect phase noise IHSIZISI The probability density function (p.d.f.) is required 
to better understand the system and evaluates the system performance. This paper 
provides an analytical expression of the p.d.f. for the Kerr effect phase noise with™ 
and without the correction by the received intensity. The characteristic functions 
are first derived analytically and the p.d.f. 's are the inverse Fourier transform of the 
characteristic functions. 

2. Probability Density Function 

For simplicity and without loss of generality, assume that the total Kerr effect phase 
noise iiPHE 

0ln = 1-4 + ni| 2 + \A + m + n 2 \ 2 + ■ ■ ■ + \A + m + ■ ■ ■ + n N \ 2 , (1) 

where A is a real number representing the amplitude of the transmitted signal, = 
%k + Wki k — 1, . . . , N, are the optical amplifier noise introduced into the system at 
the kth fiber span, are independent identically distributed (i.i.d.) complex zero- 
mean circular Gaussian random variables with E{xl} = E{yl} = E{\nk\ 2 }/2 = a 2 , 
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where a 2 is the noise variance per dimension per span. The product of fiber nonlinear 
coefficient and the effective length per span 7L e ff is ignored in (|T} for simplicity! 1 ^ * 8 * 
First, we consider the random variable of 



tpi = \A + xi| 2 + \A + x x + x 2 \ 2 + --- + \A + xi + --- + x N \ 2 . (2) 
The overall Kerr effect phase noise (JTJ) is 0ln = <fi + ^2, where 

( P2 — Vi + 1 2/i + 2/2 1 2 H h H h (3) 

is independent of y?i and has a p.d.f. equal to that of (pi when A = 0. The random 
variable of (0) can be expressed as 



V 9 1 = jVA 2 + 2Aw T x + x T Cx, 



(4) 



where w = {N, N — 1, ... ,2, 1} T , x = {xi,x 2 , ■ ■ ■ , xat} t , and the covariance matrix 
C = M T M with 



f ••• 
f f ■■■ 

M = 1 1 1 ... • (5) 

1 1 1 ••• 1 

The p.d.f. of x is (27rcr 2 )~^ exp ( — x T x/2a 2 j . The characteristic function of <p 



= ^{exp O^i)}, is 
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(27TCT 2 



2juAw x — x Tx 



dx, 



(6) 



where T = X/ (2a 2 ) — jz/C and X is an iV x N identity matrix. Using the relationship 
of 



- 2juAw T x = (x- jvAT^vjfv (x - jvAT~ l w) + v 2 A 2 w T T~ l w, (7) 



with some algebra, the characteristic function (JHJ) is 



exp 



(2a 2 )fdet[r] 1 /2 

where det[-] is the determinant of a matrix. The characteristic function (JHJ) is 



exp 



juNA 2 - 2a 2 u 2 A 2 w T (l - 2jVcx 2 C)~ 1 w 



det [X — 2jua 2 C]^ 

Substitute A = into ©, the characteristic function of <£>2 is (v) = det [X — 2jVcr 2 C] 
The characteristic function of 0ln is ^0 LN (^) = ^ ipi (h , )^ lfi2 (i'), or 



(9) 



exp 



juNA 2 - 2o 2 v 2 A 2 w T (T - 2jva 2 Cy l w 



(10) 



det [X-2jVa 2 C] 

If the covariance matrix C has eigenvalues and eigenvectors of Vf., k = 1, 2, . . . , N, 
respectively, the characteristic function (fTUj) becomes 



exp 



N 



juNA 2 - 2a 2 v 2 A 2 £ - 
fc=i 1 



(vlwf 



2jva 2 \ k 



N 

n (i - 2j^ 2 a, 



and can be rewritten to 



N 



n.r 



cxp 



k=\ 



1 — 2jva 2 \k 



(12) 



2jva 2 \ k 

From the characteristic function (|T2"|) . the random variable of 0ln © is the summa- 
tion of N independently distributed noncentral x _sc i uare random variables with two 
degrees of freedomP Without going into detail, the matrix 



C 



-1 



-1 







-1 2 -1 ••• 



-1 2 







(13) 



■•■ -1 2 

is approximately a Toepliz matrix for the series of . . . , 0, —1, 2, —1, 0, . . . For large N, 

the eigenvalues of the covariance matrix of C are asymptotically equal tcP^ 



Afe 



1 — COS 



\2k + l)^ 
2iV 



4 sin 2 



'(2k- l)n 
AN 



k = 1 iV. 



(14) 



The values of (11411 are the discrete Fourier transform of each row of the matrix C 1 . 



The eigenvalues of the covariance matrix of C are all positive and multiple to unity. 
With the correction of phase noise using received intensity,* 6 1 7 1 8 * the residual non- 



linear phase noise is 



0res = \A + rii\ 2 + \A + ni+ n 2 \ 2 H V\A + nl^ hnjv_i 
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-(a opt -l)|A + m + ■■■ + n N \ 2 , (15) 

As from the Appendix, a opt ~ (N + l)/2 is the optimal scale factor to correct the 
Kerr effect phase noise (JTJ) using the received intensity of \A + n\ + • • • + n^\ 2 . The 
random variable corresponding to (pi Pj) becomes 

(N - a opt )A 2 + 2Aw r T x + x T C r x, (16) 
where w r = w — a opt •{!,!,..., 1} T and 



C r = (M - C) T {M -C)- (o; pt - 1)£ T 4 



;i7) 



where 



C 



••• 



••• 



1 1 ■■• 1 1 

Following the procedure from (J3J) to (fTUj) . the characteristic function of 0res is 



exp 



0RES * 



-ji/(JV - a opt )A 2 - 2a 2 u 2 A 2 w r T {l - 2jva 2 C r )~ 1 w r 



det [X — 2jua 2 C r 



(19) 



The characteristic functions of </>res i n the form of eigenvalues and eigenvectors are 
similar to that of (|TT| and (|T2|) . The characteristic functions of 0res has the same 
expression as ()12j1 using a new set of eigenvalues and eigenvectors of the covariance 



matrix C r and the vector of w r . 



Except the first and last rows, the matrix C" 1 is also approximately a Toepliz 
matrix for the series of . . . , 0, —1, 2, —1, 0, . . . For large N, the eigenvalues of C r are 
asymptotically equal to 

N 

k = 2,...,N, Ai (20) 

k=2 

Other than the largest one in absolute value, the eigenvalues of C r are all positive. All 
eigenvalues of the covariance C r sum to approximately zero and multiple to a opt — 1 ~ 
(N-l)/2. 

3. Numerical Results and Random Variable Models 

The p.d.f.'s of both 0nl (JU) and 0res f|15|) can be calculated by taking the inverse 
Fourier transform of the corresponding characteristic functions of \E^ NL (V) ([lUp and 
^res^) (HH ) respectively. Fig. [T] shows the p.d.f. of NL (Q) and RE s (US)- Fig. 
^ is plotted for the case that the optical signal-to-noise ratio po = A 2 /(2Na 2 ) = 
18, corresponding to an error probability of 10~ 9 if the amplifier noise is the only 
impairment. The number of span is iV = 32. The x-axis is normalized with respect to 
NA 2 , approximately equal to the mean Kerr effect phase noise from the Appendix. 

Fig. E can confirm that using the received intensity to correct for Kerr effect 
phase noise, the standard deviation of Kerr effect phase noise can be reduced by a 
factor of two! 6 * 7 * 8 i The Appendix shows that the variance of nonlinear phase noise can 
be reduced by approximately a factor of four. 

From the characteristic function of (|T2")) , the random variables of both 0nl © and 
0res (EH! can be modeled as the combination of = 32 independently distributed 
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1 

A*- 



4shr 



(k — 1.25)7r 
2(A- 1) 



noncentral %-square random variables with two degrees of freedom. Some studies^^ 
implicitly assume a Gaussian distribution by using the Q-factor to characterize the 
random variables. When many independently distributed random variables with more 
or less the same variance are summed (or subtracted) together, the summed random 
variable approaches the Gaussian distribution. For the characteristic function of ()12j) . 
the Gaussian assumption is valid only if the eigenvalues A& are more or less the same. 
From (|14p. the largest eigenvalue Ai of the covariance matrix C is about nine times 
larger than the second largest eigenvalue A 2 . From (j2Tf|) . the two largest eigenvalues 
Ai and A2 of the covariance matrix C r are about 5.5 times larger than the third largest 
eigenvalue A3. The approximation of (fl"4"|) is accurate within 3.2% for N = 32. The 
approximation of (ffilj) is not as good as that for (fT4"J) and accurate within 10% for 
N = 32. 

While the Gaussian assumption for both NL (HJ) and 0res m ay n °t be valid, 
other than the noncentral x _sc l uare random variables with two degrees of freedom 
corresponds to some large eigenvalues, the other random variables should sum to 
Gaussian distribution. By modeling the summation of random variables with smaller 
eigenvalues as Gaussian distribution, the nonlinear phase noise of (j!2J) can be modeled 
as a summation of two or three instead of N = 32 independently distributed random 
variables. 

Note that the variance of the noncentral x _sc l uare random variables with two 
degrees of freedom in (jP2j) is 4<7 4 A 2 +4A 2 (^w) 2 PWhile the above reasoning just takes 
into account the contribution from the eigenvalue of A& but ignores the contribution 
from the eigenvector Vk, numerical results show that the variance of each individual 
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noncentral x _s qnare random variable increases with the corresponding eigenvalue of 
Afc. Later part of this paper also validates the argument. 

From Fig. [TJ the p.d.f. of 0nl has significant difference with that of a Gaussian 
distribution. Fig. |21 divides the p.d.f. of 0nl into the convolution of two parts. The 
first part has no observable difference with a Gaussian p.d.f. and corresponds to the 
second largest to the smallest eigenvalues, Afc, k = 2, . . . , N, of the characteristic 
function (jl'2J) . The second part is a noncentral x _s qnare p.d.f. with two degrees of 
freedom and corresponds to the largest eigenvalue Ai, where a 2 \i ~ 2/(7r 2 po) • NA 2 . 
The p.d.f. of 0nl in Fig. ^ is also plotted in Fig. El for comparison. The mean and 
variance of the first part Gaussian random variable are J2k=2 ^(v^w) 2 / Afc+2a 2 Afc and 
4 Hk=2 ' 4 ^l + ^ 2 (^fc ti) 2 , respectively. The second part noncentral x-square p.d.f. with 
two degrees of freedom has a variance parameter of <r 2 Ai and noncentrality parameter 
of A 2 (^w) 2 /X 1 . g 

To verify that the modeling in Fig. 12 is accurate, the cumulative tail probabilities 
are calculated by J^^p^dC, and 1% p(£)d£, where is the p.d.f. Fig. El shows 
the cumulative tail probabilities as a function of Q-factor for 0nl, defined as Q = 
(0nl — 0nl)/o"0 nl , where 0nl and <j^ nl are the mean and variance of the Kerr effect 
phase noise given in the Appendix. Using Gaussian approximation, this definition 
of Q-factor gives the same tail probability or error probabilitjJ™ f ±erfc(|Q|/\/2), 
where erfc(-) is the complementary error function. Fig. El shows the cumulative tail 
probabilities calculated by numerical integration according to (J1(J)) as circle, the model 
as the summation of a Gaussian and a noncentral x-squarerandom variable with two 
degrees of freedom of Fig. [2] as solid lines, and the Gaussian assumption as dotted 
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lines. From Fig. HU the Gaussian approximation by Q-factor is not accurate, especially 
for the tail probability for less than the mean. From Fig. the Kerr effect phase noise 
can be modeled very accurately as the summation of a Gaussian random variable and 
a noncentral %-square random variable with two degrees of freedom. From Fig. |21 
the noncentral %-square random variable with two degrees of freedom corresponding 
to Ai has a very large variance such that the p.d.f. of 0ln in Fig- Ehas significant 
difference with a Gaussian p.d.f. 

Instead of the combination of iV = 32 noncentral %-square random variables 
with two degrees of freedom, similar to the decomposition of Fig. El the random 
variable of 0res can be modeled as the summation of a Gaussian random variable 
and the difference of two noncentral %-square random variables with two degrees of 
freedom. Fig. HJ shows that the p.d.f. of 0res as the convolution of a Gaussian p.d.f. 
and two noncentral %-square p.d.f. 's with two degrees of freedom. The two noncentral 
X-square random variables correspond to the two largest eigenvalues of the covariance 
matrix C r with more or less the same magnitude but different signs. The Gaussian 
random variable corresponds to the summation of iV — 2 noncentral %-square random 
variables with two degrees of freedom for the eigenvalues of X 3 , . . . , \ N . Because the 
variance parameter of <r 2 Ai is negative, the corresponding random variable in (|12j) is 
the negative of a noncentral %-square random variable with two degrees of freedom. 
The p.d.f. corresponding to Ai in Fig. 0] is the mirror image of a noncentral %-square 
p.d.f. with two degrees of freedom with respect to the y-axis. The random variable 
corresponding to the combined term of both Ai and A2 in (|12|) is the difference of two 
noncentral %-square random variables with two degrees of freedom. 
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Fig. El shows the cumulative tail probabilities as a function of Q-factor for 0res, 
defined as Q = (4>res — 0res)/o> RE s> where RE s and cr^ are the mean and variance 
of the residual phase noise shown in the Appendix. The cumulative tail probabilities 
calculated by numerical integration according to (fTT?|) is shown as circle, the model as 
the summation of a Gaussian random variable and the difference of two noncentral 
X-square random variables with two degrees of freedom of Fig. 0] is shown as solid 
lines, and the Gaussian assumptiorf^E is shown as dotted lines. From Figs. ^ and HJ 
the p.d.f. of 0res resembles a Gaussian p.d.f. with mean and variance from jHJand the 
Appendix. The residual Kerr effect phase noise of 0res can be modeled accurately as 
a Gaussian random variable, especially for the tail probabilities less than the mean. 
Even for the tail probabilities larger than the mean, the Gaussian model for 0res 
is better than that for ^nl- If the tail probabilities for above 10~ 5 is for interests, 
Gaussian approximation for 0res can be used. 

4. Conclusion 

The characteristic functions of Kerr effect phase noise, with and without the correction 
using the received intensity, are derived analytically as product of iV noncentral \- 
square characteristic functions with two degrees of freedom. The p.d.f.'s are calculated 
exactly as the inverse Fourier transform of the characteristic functions. The p.d.f. of 
the Kerr effect phase noise can be modeled as the convolution of a Gaussian p.d.f. and 
a noncentral ^-square p.d.f. with two degrees of freedom. Using the received intensity 
to correct for the phase noise, the p.d.f. of the residual Kerr effect phase noise can 
be modeled accurately as the convolution of a Gaussian p.d.f and two noncentral 
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X-square p.d.f.'s with two degrees of freedom. The Gaussian approximation of the 
residual Kerr effect phase noise is much better than that for Kerr effect phase noise. 

Appendix: Optimal Linear Compensator 

This appendix shows important results from jH]. The optimal scale factor to minim- 
imize the variance of 0res is 



a 



opt 



N+l A 2 + (2N + l)cr 2 /3 iV + 1 



2 A 2 + Na 2 2 

The variance of the residual nonlinear phase noise of (|T3j) is reduced to 



(21) 



a 



(/'RES 



(N — 1)N(N + 1)<7 2 



A 4 + 2Na 2 A 2 + (2N 2 + l)a 4 /3 



3(A 2 + Na 2 ) 



from that of the Kerr effect phase noise of 



(22) 



aL, = -N(N + l)o 2 



The mean of the Kerr effect phase noise (0) is 



(N + \)A 2 + {N 2 + N + l)a 2 



(23) 



0nl = N A + (N + 1)^ 



(24) 



The mean of the residual nonlinear phase noise is 



= 0NL-a op t(^ 2 + 2Na z 



(25) 
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List of Figure Captions 

Fig. [TJ The p.d.f. of both NL and 0res- 

Fig. El The p.d.f. of 0nl is the convolution of a Gaussian p.d.f. and a noncentral 
X-square p.d.f. with two degrees of freedom. 

Fig. El The cumulative tail probability of 0nl as compared with the model of Fig. |2] 
and Gaussian approximation. 

Fig. |U The p.d.f. of 0res is the convolution of a Gaussian p.d.f. and two noncentral 
X-square p.d.f. 's with two degrees of freedom. 

Fig. |3J The cumulative tail probability of 0res as compared with the model of Fig. |U 
and Gaussian approximation. 
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Fig. 1. The p.d.f. of both NL and 0res- 




Fig. 2. The p.d.f. of 0nl is the convolution of a Gaussian p.d.f. and a noncentral 
X-square p.d.f. with two degrees of freedom. 
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Fig. 3. The cumulative tail probability of 0nl as compared with the model of 
Fig. El and Gaussian approximation. 
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Fig. 4. The p.d.f. of 0res is the convolution of a Gaussian p.d.f. and two 
noncentral x-square p.d.f. 's with two degrees of freedom. 
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Fig. 5. The cumulative tail probability of 0res as compared with the model of 
Fig. |3] and Gaussian approximation. 
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